Considering Error Propagation in Stepwise Polynomial Regression
نویسندگان
چکیده
The selection of an optimal regression model comprising linear combinations of various integer powers of an independent variable (explanatory variables) is considered. The optimal model is defined as the most accurate (minimal variance) stable model, where all parameter estimates of the orthogonalized explanatory variables are significantly different from zero. The potential causes that limit the number of terms that can be included in a stable regression model are investigated using two indicators, which measure signal-to-noise ratios in the variables. The truncation-to-noise ratio indicator is used to measure the extent of collinearity between the explanatory variables and the correlation-to-noise ratio indicator to evaluate the significance of the correlation between an explanatory variable and the dependent variable. It is shown that the number of terms that can be included in a stable polynomial model (and its accuracy) depend on the range and precision of the data, the rate of the error propagation during computations, and the algorithm used to calculate the regression parameters. It is demonstrated that it can often be advantageous to include nonconsecutive powers of the independent variable in an optimal polynomial model. An orthogonalized-variable-based stepwise regression procedure is presented, which enables identifying the optimal model in polynomial regression.
منابع مشابه
Orthogonal Bases for Polynomial Regres- Sion with Derivative Information in Uncer- Tainty Quantification
We discuss the choice of polynomial basis for approximation of uncertainty propagation through complex simulation models with capability to output derivative information. Our work is part of a larger research effort in uncertainty quantification using sampling methods augmented with derivative information. The approach has new challenges compared with standard polynomial regression. In particul...
متن کاملAPPLICATION OF EVOLUTIONARY POLYNOMIAL REGRESSION IN ULTRAFILTRATION SYSTEMS CONSIDERING THE EFFECT OF DIFFERENT PARAMETERS ON OILY WASTEWATER TREATMENT
In the present work, the effects of operating conditions including pH, transmembrane pressure, oil concentration, and temperature on fouling resistance and the rejection of turbidity for a polymeric membrane in an ultrafiltration system of wastewater treatment were studied. A new modeling technique called evolutionary polynomial regression (EPR) was investigated. EPR is a method based on regres...
متن کاملA software toolbox for data analysis and regression, considering data precision and numerical error propagation
An algorithm for data analysis and regression by orthogonalized-variable-based stepwise regression (SROV) has been developed and was implemented as a MATLAB toolbox. The program uses QR decomposition based on Gram-Schmidth orthogonalization, which is highly resilient to numerical error propagation, for regression. Variables are selected to enter the regression model according to their level of ...
متن کاملConsidering precision of experimental data in construction of optimal regression models
Construction of optimal (stable and of highest possible accuracy) regression models comprising of linear combination of independent variables and their non-linear functions is considered. It is shown that estimates of the experimental error, which are most often available for engineers and experimental scientists, are useful for identifying the set of variables to be included in an optimal regr...
متن کاملIdentifying and removing sources of imprecision in polynomial regression
Identification and removal of imprecision in polynomial regression, originating from random errors (noise) in the independent variable data is discussed. The truncation error-to-noise ratio (TNR) is used to discriminate between imprecision dominated by collinearity, or numerical error propagation, or inflated variance due to noise in the independent variable. It is shown that after the source o...
متن کامل